Wideband Speech Recovery Using Psychoacoustic Criteria

نویسندگان

  • Visar Berisha
  • Andreas Spanias
چکیده

Manymodern speech bandwidth extension techniques predict the high-frequency band based on features extracted from the lower band.While this method works for certain types of speech, problems arise when the correlation between the low and the high bands is not sufficient for adequate prediction. These situations require that additional high-band information is sent to the decoder. This overhead information, however, can be cleverly quantized using human auditory system models. In this paper, we propose a novel speech compression method that relies on bandwidth extension. The novelty of the technique lies in an elaborate perceptual model that determines a quantization scheme for wideband recovery and synthesis. Furthermore, a source/filter bandwidth extension algorithm based on spectral spline fitting is proposed. Results reveal that the proposed system improves the quality of narrowband speech while performing at a lower bitrate. When compared to other wideband speech coding schemes, the proposed algorithms provide comparable speech quality at a lower bitrate.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Wideband coding of speech using neural network gain adaptation

In this paper, a high-quality wideband speech coder is proposed. The coding structure resembles a LD-CELP coder, however, several novel improvements are made. The gain adapter for the stochastic codebook is driven by a neural network and it updates the excitation gain in a sample-by-sample fashion. The purpose of incorporating a neural network is to exploit both the intraand inter-frame correla...

متن کامل

Noise Suppression using a Perceptual Model for Wideband Speech Signals

Traditional algorithms for suppressing background noise in speech signals can add annoying artefacts to the resulting denoised signal. In applications requiring better than toll quality, it is desirable that noise suppression should not add any audible artefacts. This paper describes a method that is effective for narrowband and applies these methods to wideband signals. The method presented us...

متن کامل

Low Distortion Acoustic Noise Suppression Using a Perceptual Model for Speech Signals

Algorithms for the suppression of acoustic noise in speech signals are generally Short-Time Spectral Amplitude (STSA) methods such as Spectral Subtraction. These methods have been effective at reducing or removing the background noise, but have a tendency (at low SNR) to add annoying artefacts, such as musical noise, and distortion of the speech signal. By employing an auditory model, psychoaco...

متن کامل

Perceptual Models for Speech, Audio, and Music Processing

New understandings of human auditory perception have recently contributed to advances in numerous areas related to audio, speech, and music processing. These include coding , speech and speaker recognition, synthesis, signal separation , signal enhancement, automatic content identification and retrieval, and quality estimation. Researchers continue to seek more detailed, accurate, and robust ch...

متن کامل

Wideband Speech Recovery from Narrowband Speech Using Classified Codebook Mapping

Speech sounds occupy 8 kHz or more of bandwidth. However, current public telephone networks limit the speech bandwidth to 300–3400 Hz. Telephone speech is characterized by thin and muffled sounds, and degraded speaker identification. We describe an algorithm which generates the missing highband components from the narrowband speech signal. The algorithm is based on three acoustic-phonetic class...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • EURASIP J. Audio, Speech and Music Processing

دوره 2007  شماره 

صفحات  -

تاریخ انتشار 2007